XtremWeb & Condor sharing resources between Internet connected Condor pools
نویسندگان
چکیده
Grid computing presents two major challenges for deploying large scale applications across wide area networks gathering volunteers PC and clusters/parallel computers as computational resources: security and fault tolerance. This paper presents a lightweight Grid solution for the deployment of multi-parameters applications on a set of clusters protected by firewalls. The system uses a hierarchical design based on Condor for managing each cluster locally and XtremWeb for enabling resource sharing among the clusters. We discuss the security and fault tolerance mechanisms used for this design and demonstrate the usefulness of the approach measuring the performances of a multi-parameters biochemistry application deployed on two sites: University of Wisconsin/Madison and Paris South University. This experiment shows that we can efficiently and safely harness the computational power of about 200 PC distributed on two geographic sites.
منابع مشابه
A worldwide flock of Condors: Load sharing among workstation clusters
Condor is a distributed batch system for sharing the workload of compute-intensive jobs in a pool of Unix workstations connected by a network. In such a Condor pool, idle machines are spotted by Condor and allocated to queued jobs, thus putting otherwise unutilized capacity to e cient use. When institutions owning Condor pools cooperate, they may wish to exploit the joint capacity of their pool...
متن کاملLeveraging HTC for UK eScience with Very Large Condor pools: Demand for transforming untapped power into results
We provide an insight into the demand from the UK eScience community for very large High Throughput Computing resources and provide an example of such a resource in current production use: the 930-node eMinerals Condor pool at UCL. We demonstrate the significant benefits this resource has provided to UK eScientists via quickly and easily realising results throughout a range of problem areas. We...
متن کاملImplementation of Decentralized Load Sharing in Networked Workstations Using the Condor Package
In recent years a number of load sharing (LS) mechanisms have been proposed or implemented to fully utilize system resources. We have designed and implemented a decentralized real-time LS mechanism based on the Condor package 17, 18]. Two important features of our design are use of region-change broadcasts in the information policy to provide each workstation with timely state information at mi...
متن کاملCondor flocking : load sharing between pools of workstations Report 93 - 104 X .
A selection of these reports is available in PostScript form at the Faculty's anonymous ftp-site. They are located in the directory /pub/publications/tech-reports at ftp.twi.tudelft.nl
متن کاملMaking Workstations a Friendly Environment for Batch Jobs
As time-sharing machines are replaced by powerful desktop computers and farms of workstations replace mainframes, more and more users turn to workstations when they need CPU cycles for their batch jobs. Unfortunately, they do not find workstations a very friendly environment for batch processing. Since these types of machines were originally designed as a single user environment, they lack most...
متن کامل